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Amendment* tu the Spec ification : 

Please replace the title with the following rewritten title: 

METHOD AND APPARATUS PGR DYNAMICALLY PROGRAMMING A FIELD 
PROGRAMMABLE GATE ARRAY (PPG A) IN A COPROCESSOR 

Please tmend the pas t rapii beginning ■ page I line 6 as follows 

'Die use of Field Programmable Gate Arrays (FPGA,t on an Application Specific 
Integrated Circuit (ASIC) chip and/or a system-on-a-ehip to provide instruction level hardware 
acceleration and resource sharing is i r r .air i igute I illustrates t co tvei u • I 
apparatus utilizing an PPGA. The apparatus can be an ASIC, a system-on-a-ehip, or some other 
chip comprising a main processor 102, a coprocessor 104, and a local bus. such as the Processor 
Local Bus (FEB) 106 developed, by International Business Machines Corporation™, Other local 
buses can also be used, such as the Advanced Micro-Controller Bus Architecture (AMBA) 
developed by ARM™. The coprocessor 104 comprises PPGA ceils i 1 0 and a plurality of 
interfaces. The interlaces include a programming interface 108, through which the FPGA 1 10 is 
programmed, a PLB interlace 1 12, and an Auxiliary Processing Unit (APU) interface 114. The 
APE interface 1 14 enables special hardware accelerated functions it) be tightly coupled to the 
P ees ! 1 1 if ' c us tt i ( } b \ I i s K jpk [ tiom, the proc ssot 1A 1 

communicates with the coprocessor 1 04 via the PLB 106. fetched instructions inside the 
processor 102 are simultaneously shared with the coprocessor 104 through the APU interface. 
The coeioccv*" !04 -dgnuK the pu cos „ j PP when it .sees a \alid tnsmtctum, m operations 
code ! o,\ixL ) for J ■> execution unit. The coprocessor 104 then performs the requested 
) unction on operands supplied with the instruction and passes the result back to the processor 102 
through the APU interface 1 1 4, 

4<- 



Please amend the paragraph beginning at page 3, line 18 as follows: 

figure I illustrates a convention a i apparatus utilizing an FPGA. 

PK t v i i en t[i pu , i hb nni! l | <. f nt 1 s follows 

Figure 2 illustrates a preferred embodiment of an apparatus tor dynamically programming 
an FPGA in accordance with the present invention. 

Please amend the paragraph beginning at page 3. line 21 as follows: 

Figure 3 is a flowchart illustrating a preferred embodiment of a method for dynamically 
programming an FPGA in accordance with the present invention. 

Pi* a ( wacnd the paragraph beginning at j t I line 1 i folio - 

Figure 4 illustrates a preferred embodiment of a coprocessor with a dynamically 
programmable FPGA in accordance with the present invention. 

Flea tn , ^ - ■ „ • line ^ as follows: 

F igure 5 is a flowchart illustrating in more detail the preferred embodiment of the method 
for dynamically programming an FPGA in accordance with the present invention. 

Please amend the paragraph beginning at page 4. Sine ? as follows: 

Figure 7 illustrates a second preferred embodiment of the apparatus for dynamically 
programming an FPGA in accordance with tire presem invention. 



Please amend the paragraph beginning at page 4, hue 2i as follows: 

[ n ■ * i in, r.,.,,i jvcle )0<f .mhudimt-ntoUmapp^arus tot .'nun h IK p 'Mian m iv 
an I Pt < \ m accoidau^ vuth the ptcM nt msenuon I ho apparatus, eompi *s a n am pieces' o, 
202. a coprocessor 204. and a Process 1 oca] Bns (Pi B) 206 as the local bus. uhei types oi 
I < VX s{ t - a \- njn !»_«.) 1 . "J . uk x S " M - 

T n„ imm main proc essor 202. The processor 202 comprises an exception subroutine 
T\T which 1 1 de^u .he I lu diei belo^ 1 he cnpiotev^i 204 composes a dy.wune 
programmable FPGA 21.0, a programming inter face 208 for programming the FFGA 210, a PLB 
interface 212, and an APU interface 214. The apparatus farther comprises amemory 216 for 
storing configuration bit streams 220 for various functions thai, can be requested by applications 
executed by the processor 202, The memory 216 can be embedded on a chip along with the 
processor 202 and coprocessor 204 or external to the chip. According to the present invention, 
during the execution of an application by the processor 202, if a function requested by the 
application had not been programmed into the FPGA 210, the configuration bit stream for the 
function can be fetched from the memory 216, and sent to the programming interlace 208 via a 
programming channel 2 ! 8 coupled to the PLB 206, The FFGA 21 0 em is then programmed with 
the fetched configuration bit stream. Once programmed, the function requested by the 
application can be performed by the coprocessor 204. In this manner, the FPGA 21.0 m /is 
programmable "on the lV\ i.e., dynamically during the execution of an application. 

Please amend the paragraph beginning at gage 5, line 17 as follows: 

t mure 3 is a flowchart illusuatmy > nckue 1 eral < dit lent 1 1 a iiethud ft \ ihna meafo 
pjoojmmm. rtHf V in iccordance with the present i vennon Mrs the execution o1 an 
application b> me pro. >soi 202 h ins, Ga step 302. Next he coproce^ci 2u4 icceisc- an 



infraction from the pieces^* .?o: tu pctfoim function m he applnaUon \ ia step 304 In tht 
pre leu cd embodiment the m>rm!roti is sail tu the \PU interface M4 ot fie cop.<«.c->ot 2(>4 
Next, it is determined that the FPGA 2 10 of the coprocessor 204 is not programmed with trie 
logic ibr the requested function, x ia step 306. In the preferred embodiment the VP! imei ! kv 
214 i turns an error when the FPGA 2 1 0 are is not. pi gran mod will v E tlmmthe 

function. 1 hen. the processor 202 fetches the configuration bit stream 220 for the function from 
the memory 21o via step 308. The '{- PGA 210 are is then programmed nia^i.hi'u with the 
configuration bit stream 220, via step 3 ? 0. 

Please amend the paragraph beginning at page 6. One 6 as follows: 

In the preferred embodiment, when the processor 202 receives she error from the API) 
interface 214, the processor .202 initiates the exception subroutine 222, which fetches the 
configuration bit stream 220 for the function from the memory 2 1 6. The bit stream 220 is then 
sent to the programming interface 208 via the programming channel 218. In the preferred 
embodiment, the programming channel 21$ is a Direct Memory Access (DM A) channel. The 
exception subroutine 222 of the processor 202 would point the DMA channel to the location in 
the memory 216 where the configuration bit stream 220 ibr the requested function is stored. !h 
1A1 \ then left Kl thee nhju m i b t >tr, n i 220 and vaik^m !lv ui ii i a-* n^eanGA 
into the FPGA 210. Other types of programming channels can be used without departing from 
the spirit and scope of the present invention. For example, input/Output (I/O) register logic on 
the PLB 206 can be used, through which the processor 202 can program the FPGA 210 directly. 



Please amend the paragraph beginning at page 6, line 1 7 us follows: 

i muK 1 fl istmuN tneFiKi Omop! h fi am x^mji AH u ith a d^ am > 



programmable FPOA in accordance with the present invention. The coprocessor 204 comprises 
a pl urality of coprocessor state machines (CSM.1 -CSM3). Each coprocessor state machine 
(CSM) represents a function type and manages one or more coprocessor managed resources 
i k MP t 4 MA'! i h iopi iu oi n t \n_^d resounds o MR) repiesent the ham inside ihv 
PPGA 210 for each coprocessor function managed by its corresponding CSM An instruction 
received by the APU interface 214 describes the instruction type and may contain additional 
mfotmiiumu! n mmemm I h< \H interlace 2 14 founds tb«. mMmumo to thw ippiopuate 
eSMbu.vd «n the justiuetmi H>pc ) he < ^M then e\euue> th*. apmoptun l MR Uu the 
requested function, if noire of the CMRs were pupu earned w ah the uqucted function, the 
APU interlace 214 returns an error to the processes 2<>2 ! he « on figuration hit stream los the 
requested function is fetched, and the appropriate CMR is programmed with the configuration bit 
stream, .a described abo\ e 

Please amend the paragraph beginning at page 7, hue 8 as follows: 

For example, assume thai the application issues an instruction to the coprocessor 204. 
requesting thai a specific event timer function be performed, via step 302, Tire APU interface 
214 iecem.Nthc msttuAmn an I mmafd a fo,o hA I n gjoa >n u ( Mk i « SM] sbMi 
manage^ e\cnt tiinci functions t SMS t MPJ Jeiomures that the hygc for the requested eunf 
timer ttrmtum hm not heen pmc. 1 tn t i in en < i t MR t MRM. \i step A>0 11k \Pt 
interface 214 returns this error to the processor 202. The processor 202 then executes the 
exception subroutine 222. The exception subroutine 222 fetches the configuration bit stream for 
the tequcskd cum tumm Summon Uom memon 21e and 'ends pie eonfummtion hit stu mi to 
fi e pun ramming inter! ce 20S One ol the i MR s is then programme 1 w ith the appropriate 
configuration bit stream. 

-6- 



Please amend the paragraph beginning at page ?, Line 18 as follows: 

Figure 5 is- a.flowcbart illustrating in more detail the preferred embodiment of the method 
M d nuutam'h " commm ng ai U'< \ n n^, » wnh the pu^ent mwmU n Dunng 
execution of an application, the processor 202 fetches an instruction, via step 50.2. The 
instruction is sent to the APU interface 214, via step 504, if the instruction is for the coprocessor 
204, the A PU interlace 21 4 issues a commit; via step 506. If not, the APU interface 214 
processes the next instruction on the processor pipeline, via step 508 if the APU interlace 214 
commits, then the APU interface 214 attempts to execute the function. If the function has been 
programmed into the PPGA 210, then the PPGA execute the function, via step 516. If the 
coprocessor 204 can then respond immediately with a value in the results register, via step 5 1 8, 
then the result is returned, via step 522, and the instruction completes, via step 524. If the results 
cannot be returned immediately, then the function requires more clock cycles to complete, aid 
the processor pipeline is held for the result, via step 520. 

Please amend the paragraph beginning at page 8, hue 12 as follows; 

Figure 6 is a flowchart illustrating in more detail the programming of the PPGA with the 
desired function m accordance with, the present, invention. If the function has not been 
programmed into the PPGA 210, the coprocessor 204, and more specifically the APU interlace 
214, provides an exception code in a results register, via step 002, The processor 202 responds to 
ihe.exception ( xfc bs n n I ng to t.h< exceptioj ibroi int 222 via step 604 fhe exception 
subroutine 222 decodes the function identifier passed to it. via step 606. The exception 
subroutine 222 then requests and is granted o wnership of the required function by the 
programming channel 218 across the PUB 206, via step 608. The processor 204 then fetches the 



configuration bit stream 220 for the function from the memory 216, via step 610. The exception 
sabtoutmc ' ^kunoe- the e^epu -n !\p L md the u-pnc sn >- liMt J uon rvj m.i step f-12. 
11k. t ,puon th<h!tm> " k th^ t re 1 ioi r a seaue u e + load .ion. imti iu tRUJ*. \ mthe 
programming channel 218 and the programming interface 208 to program the FFGA 210 with the 
configuration bit stream, via step 614, The processor 202 then reissues the instruction, via step 
616. 

Please amend the paragraph beginning at page 9. line 9 as follows: 

Fiaurc 7 illustrates a second preferred embodiment of the apparatus for dynamically 
programming an FFGA in accordance with the present invention. In this embodiment, a plurality 
of processors 702. 708 , and 712 share system resources, such as program memory (no t shown) 
and coprocessors 704, 710, and 714. Each coprocessor 704, 710, and 714 acts as a Shared 
Resource Agent and each is attached to the VLB 206 through a Shared Resource Interface (SRI) 
706, 7 10, and 716, respectively. Coupled to the PLB 206 is a Shared Resource Manager (SRM) 
718. The SRM 7)8 guarantees data consistency is maintained between the shared resources by 
making sure the right to modify a given resource is only given to one processor at a time. Each 
processor 702, "OS T and eoptoces,-ors *"", ) 4 can nakpeodeuth access the shared 
svst em resources through their respective SRI 706, 710, 716, with their access controlled by the 
SRM 'IS I he programri it g oi the I PG Vs in each oi tl e coprocessors 704 710, 714 is 
peik imed m Ox s » re tnannet a, d s. ih d iho\ i c ,ee^ the M<\ lb can seise as the 
programming channel 2 ! 8, and the instructions fetched by a processor 702. 70S, 7 1.2 can be sent 
in the FPU \s uk-! si ynjefam f the coprocessors 704 0, "11 ! he SRM ~ ! 8 cm program 
ikkbuKon fid aim trann Dniou. iidi'U'^d" ^ » i n 'Q vol u pum^"> "04, 
no 7U \henasf\cb the eacit pn , -s i ■ . kyS 4_umup - ^ituc mm hcSRM 



718, L t l M\\ memo* i'K u ^ U > >«. w U pto^s g > i 

the Oil ■'•PP^i !!J!0 ! l'l t \ 

Please amend the paragraph beginning at page 1 0, line 4 as follows: 

Another exception condition can occur when the requested Junction must be loaded into 
cnpjocessoi bui there is not enough unused logic resources in the I Pti V assoeuted >\_nh the 
uipox.^oi to piognjin the neo UJin In inn nuan n n, n eiU! * I '^cd < 1 R I » 
algorithm can be tmpl mu> d u hef m the .optoce-soi o as a fin id tunning on the processor 
\ 1 kl function i na u <et struck of the motion loaded! t< the eoproct sors and how 
often each has been used in a given period of time. Tins LRU function manager can be queried 
to determine which old function can be disabled to free up logic resources to load the new 

lu-uoi ,i he cuproccbsoi can act as a cache ol unctions lot tlx pok ^ » H.is 

library of functions is. managed by the SRM 718. The exception processing of the processor is 
the same as described above except for the added delay caused by the LRU function manager 
operation. 

Please amend the paragraph beginning at page 10, line 15 as follows: 

A method and ap irao lot iynaraical pre ramm.i.M m FPGA have been disclosed 
The method includes beginning an execution of an application by the processor; receiving an 
m-l ui tier Pom i i_ pnxe^ « n the uui > c >m i pu\>ru • ha a. on h>! tiu tppb.au.' i 
determining that the FPGA in the coprocessor is not programmed with logic for the function: 
fetching a configuration hit stream tor the function; and programming the LPGA, with the 
configuration bit stream. In this manner, the FPGA is programmable %n the fly" i.e.. 
dynamically during the execution of an application. Logic flexibility and space savings on the 
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chip comprising the coprocessor and processor arc provided as well. 
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